Informed source separation of orchestra and soloist using masking and unmasking
نویسندگان
چکیده
A novel technique of unmasking to repair the degradation in sources separated by spectrogram masking is proposed. Our approach is based on explicit knowledge of the musical audio at note level from a score-audio alignment, which we termed Informed Source Separation (ISS). Such knowledge allows the spectrogram energy to be decomposed into note-based models. We assume that a spectrogram mask for the solo is obtained and focus on the problem of repairing the degraded audio. We evaluate the spectrogram as well as the harmonic structure of the music: we either search for unmasked (orchestra) partials of the orchestra to be transposed onto a masked (solo) region or reshape a solo partial with phase and amplitude imputed from unmasked regions. We describe a Kalman smoothing technique to decouple the phase and amplitude of a musical partial that enables the modification to the spectrogram. Audio examples from a piano concerto are available for evaluation.
منابع مشابه
Informed Source Separation of Orchestra and Soloist
A novel technique of unmasking to repair the degradation in sources separated by spectrogram masking is proposed. Our approach is based on explicit knowledge of the musical audio at note level from a score-audio alignment, which we termed Informed Source Separation (ISS). Such knowledge allows the spectrogram energy to be decomposed into note-based models. We assume that a spectrogram mask for ...
متن کاملImproved Extraction-Separation of Lanthanum/ Europium Ions by Bis(2-ethylhexyl)phosphoric Acid Using 12-Crown-4 as an Ion Selective Masking Agent
The extraction-separation of lanthanum and europium ions with bis(2-ethylhexyl) phosphoric acid (DEHPA) in cyclohexane was improved by adding 12-crown-4 (12C4) to the aqueous phase as a masking agent. In the presence of this crown ether the extraction curve of lanthanum ions versus pH shifts to the higher pH region, while the curve is not influenced for the europium ions. Consequently the s...
متن کاملThe role of perceived spatial separation in the unmasking of speech.
Spatial separation of speech and noise in an anechoic space creates a release from masking that often improves speech intelligibility. However, the masking release is severely reduced in reverberant spaces. This study investigated whether the distinct and separate localization of speech and interference provides any perceptual advantage that, due to the precedence effect, is not degraded by ref...
متن کامل"Sparsification" of Audio Signals Using the MDCT/IntMDCT and a Psychoacoustic Model - Application to Informed Audio Source Separation
Sparse representations have proved a very useful tool in a variety of domain, e.g. speech/music source separation. As strictly sparse representations (in the sense of l) are often impossible to achieve, other ways of studying signals sparsity have been proposed. In this paper, we revisit the irrelevance filtering analysis-synthesis approach proposed in (Balazs et al., IEEE Trans. ASLP, 18(1), 2...
متن کاملSpatial unmasking of birdsong in human listeners: energetic and informational factors.
Spatial unmasking describes the improvement in the detection or identification of a target sound afforded by separating it spatially from simultaneous masking sounds. This effect has been studied extensively for speech intelligibility in the presence of interfering sounds. In the current study, listeners identified zebra finch song, which shares many acoustic properties with speech but lacks se...
متن کامل